Roulette-wheel selection via stochastic acceptance 
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Roulette-wheel selection is a frequently used method in genetic and evolutionary algorithms or in 
modeling of complex networks. Existing routines select one of N individuals using search algorithms 
of O(N) or 0(log N) complexity. We present a simple roulette-wheel selection algorithm, which 
typically has O(l) complexity and is based on stochastic acceptance instead of searching. We also 
discuss a hybrid version, which might be suitable for highly heterogeneous weight distributions, 
found, for example, in some models of complex networks. With minor modifications, the algorithm 
might also be used for sampling with fitness cut-off at a certain value or for sampling without 
replacement. 
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I. INTRODUCTION 



Finding low-energy configurations of crystals [l| , disor- 
dered magnets [1] or proteins [H Reconstructing geologi- 
cal structure from seismic data [J], and analysing X-ray 
data [f| are just a few examples of optimization prob- 
lems in physical sciences @. Various techniques have 
been developed to approach these problems and one of 
the most frequently used is genetic algorithms [7j. Their 
basic idea is to mimic the way biological evolution cre- 
ates apparrently better fitted species. To do so one needs 
to represent a pool of optimization methods (routines, 
functions, strategies, etc.) as a population of individu- 
als, which, as in nature, is subjected to two, in a sense 
opposing, processes. On the one hand, due to mutations 
or crossing-over operations, the variability in the popula- 
tion increases. On the other hand, to guide the evolution 
in a desired direction, one has to trim the population 
with some selection mechanisms. It turns out that such 
a biology-inspired scheme allows us to find optimal or 
nearly optimal solutions of various problems in a very 
efficient way 

Selection is an important part of genetic algorithms 
since it affects significantly their convergence. The basic 
strategy follows the rule: The better fitted an individ- 
ual, the larger the probability of its survival and mating. 
The most straightforward implementation of this rule is 
the so-called roulette- wheel selection Q. This method 
assumes that the probability of selection is proportional 
to the fitness of an individual. It can be briefly described 
as follows. Let us consider N individuals, each character- 
ized by its fitness Wi > (i = 1, 2, . . . , N). The selection 
probability of the i-th individual is thus given as 
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Let us imagine a roulette wheel with sectors of size pro- 
portional to Wi (i — 1, 2, . . . , N). Selection of an individ- 
ual is then equivalent to choosing randomly a point on 
the wheel and locating the corresponding sector (Fig. [T]) . 
When a simple search is used, such a location requires 
O(N) operations while the binary search needs 0(log N) 
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FIG. 1. In a roulette-wheel selection, one constructs a line 
segment of length Y^%=i Wi ou ^ of consecutive sectors of length 
Wi (i = 1, 2, . . . , iV), generates a random number r such that 
< r < ^2 i=1 Wi, and locates the corresponding sector (ws 
in this case), thus selecting the respective individual. When 
a simple search of an iV-element list is used for location, the 
procedure requires O(N) operations, which can be reduced to 
O(logiV) by means of the binary search. 



operations. 

There are also other methods with the selection proba- 
bility depending on the fitness, such as stochastic remain- 
der or stochastic universal selection [3, |T0|. They have 
slightly different statistical properties and in general lead 
to populations of larger variability. In yet another class 
of selection methods, the ranking or ordering of individ- 
uals rather than their fitness plays a central role 
It is certainly difficult to indicate which of these meth- 
ods is the best [HI, [l3[ . Although some studies suggest 
that fitness-based selection methods suffer from certain 
scaling- related problems [Tij ]. there are some works that 
seem to alleviate this difficulty fl5j . 

Let us notice that the roulette-wheel selection is also 
used in the generation of complex networks. For exam- 
ple, in some models of growing networks, a newly added 
site is linked to one of the already existing sites with 
probability proportional to the degree of this old site fisl — 
|l8j. This preferential-attachment algorithm is known to 
generate highly heterogeneous scale-free networks, which 
recently have been intensively studied [l9J . The roulette- 
wheel selection is also used in certain adaptive network 
models H. 
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Due to simplicity of implementation and straightfor- 
ward interpretation, the roulette-wheel selection is fre- 
quently used in genetic algorithms. Although the binary 
search significantly reduces computational complexity, 
still faster implementations would be, in our opinion, de- 
sirable. In the present paper, we show that the roulette- 
wheel selection can be realized with a simple algorithm of 
typically O(l) complexity. The proposed algorithm does 
not use searching but is based on a stochastic acceptance 
of a randomly selected individual. 

II. DESCRIPTION AND PROPERTIES OF THE 
ALGORITHM 

Our algorithm consists of the following steps: 

1. Select randomly one of the individuals (say, i). The 
selection is done with uniform probability (I/TV), 
which does not depend on the individual's fitness 
Wi (Fig. HI). 

2. With probability Wi/w max , where w max = 
m&x{wi}fL 1 is the maximal fitness in the popula- 
tion, the selection is accepted. Otherwise, the pro- 
cedure is repeated from step 1 (i.e., in the case of 
rejection, another selection attempt is made). 

Of course, the probability that the i-ih individual will 
be selected in a single attempt equals Wi/(Nw max ). How- 
ever, since several first attempts might fail, one has to 
calculate the probability that the individual selected in 
an unspecified number of attempts will be the i-th indi- 
vidual. Straightforward calculations give 

P> = -^(l + q + cf + ...) (2) 

where 

1 N I \ 

?=^£(W^a*) = 1-^^ = 1--^ (3) 

A r-T< «Wmax Wmax 

is the rejection probability and (w) = (2i=i Wi)/N is the 
average fitness. The geometrical series ([2]) is convergent 
(0 < q < 1) and summing it up, we easily find that 

Pi = ft (4) 

where pi is defined in Eq. (JTJ) . This shows that the prob- 
ability distribution of our procedure is indeed the same 
as in the roulette-wheel selection. 

Similarly, we can calculate the average number of at- 
tempts r needed for a single selection of any individual. 
One obtains 

1 N 

- = ^E^[ 1 + 2 « + 3? 2 + .--] = ^ (5) 
N f-^ w max (w) 

Although r, which determines the computational com- 
plexity of our algoritmm, does not depend explicitly on 
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FIG. 2. In the proposed algorithm, one selects randomly 
(with equal probability l/N) an individual (say, i) and ac- 
cepts such a selection with probability Wi/w max , where w ma x 
is the maximal fitness. In case of rejection, the procedure 
is repeated anew (i.e., a new individual is selected repeat- 
edly until acceptance). Although it requires extra call(s) of 
a random-number generator, it remains typically O(l) algo- 
rithm. 



A, the ratio might change with A, depending on 

the specificity of the problem. We expect, however, that 
in many applications, for example, those where fitness 
remains bounded (wi < B) and (w) does not diminish to 
for increasing A, the ratio does not increase un- 
boundedly with A and thus a typical complexity of our 
algorithm should be 0(1). 

One can examine a more general algorithm where ac- 
ceptance is made with probability Wi/A, where A > u> ma x 
is a certain constant. For such an algorithm the selection 
probability p\ also satisfies Eq. (j4j. However, a smaller 
acceptance probability results in a larger rejection prob- 
ability and the efficiency of the algorithm is reduced (as 
t increases). For A < w mscx , some of the fractions Wi/A 
may turn out greater than unity and the equality (|4]) 
does not hold. Thus, the choice A = u> max ensures opti- 
mal performance. 

To examine its performance, we applied our algorithm 
to a population of A individuals with fitness being a uni- 
formly distributed random number < lOj < 1. We 
implemented the roulette-wheel selection algorithms us- 
ing linear or binary search as well as our stochastic ac- 
ceptance method. We found that the distributions of 
selected individuals generated by these programs were 
within statistical errors the same. Average execution 
time per single selection as a function of N is shown in 
Fig. [3J As expected, the linear and binary search show 
O(N) and 0(logN) behaviour, respectively. Our algo- 
rithm requires basically A^-independent CPU time and a 
slight increase for large N is due to excessive memory re- 
quirements that exceeded the size of cache memory. We 
also measured the average number of attempts t. In our 
numerical example, u> max « 1 for large A, and (w) ~ 1/2. 
Thus, according to Eq. ©, t « 2, and our numerical data 
are in excellent agreement with this result (Fig. [3]). 
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FIG. 3. The average CPU time t (in [is) of roulette-wheel 
selection of a single individual as a function of the popula- 
tion size N (log-log scale). As expected, the linear search 
has the complexity O(N) and the binary search 0(log(AT)). 
The stochastic acceptance algorithm behaves as O(l) and a 
slight increase for the largest N is due to excessive memory 
requirements that exceeded the size of cache memory. Calcu- 
lations were made on the Intel Xeon 2.8GHz processor. Filled 
squares show the average number of attempts per selection. 



III. POSSIBLE EXTENSIONS AND 
CONCLUSIONS 

As we have already mentioned in Introduction, the 
roulette-wheel selection is used in some models of com- 
plex networks that are based on preferential attachment. 
The networks generated in this way are strongly inho- 
mogeneous with a small fraction of sites having a very 
large degree. The distribution of weights, which are 
proportional to degrees, is thus very broad and the ra- 
tio might be quite large. Consequently, the ef- 
ficiency of our algorithm might diminish. However, a 
simple modification of the algorithm might lead to a 
better performance. First, let us assume that there is 
one weight that is much larger than the others, say 
W\ (= u> max 3> Wi, i — 2,3, ... N). In this case we 
might use the following hybrid algorithm, which com- 
bines search and stochastic acceptance: With probabil- 



ity pi = Wi/y^--, Wi one selects the first individual and 
with probability 1 —p\ one of the remaining N— 1 individ- 
uals (using roulette- wheel applied to N — 1 weights). In 
the latter step, the stochastic acceptance should be quite 
efficient since the largest weight w\ was removed. The 
probability of selection of the individual i(> 1) equals 
(1 — pi) ^aT* — = ■?=nv £i — , thus it is indeed equal to (J]). 

Generalization to the case where there are several much 
larger weights is straightforward. 

To ensure a sufficiently large variability of the popula- 
tion, it is sometimes desirable in optimization problems 
to use sampling without replacement. In order to guar- 
antee that a once selected individual is never selected 
again, one can simply set its fitness to 0. However, when 
the individual happened to have the maximal fitness, a 
new maximum should be found. Sacrifying slightly the 
efficiency, one can use the old maximum to calculate the 
acceptance probability Wi/w max . Similarly, when fitness 
of individuals is known to be bounded (tUj < B), it is not 
needed to keep track of the current maximum, as one 
can use Wi/B as the acceptance probability. Although 
the efficiency of the algorithm is reduced, it still should 
remain of the 0(1) type pll |. 

The replacement of u> max with a certain constant A < 
io mal in our algorithm might be yet another way of in- 
creasing variability in the population. Indeed, in such a 
case individuals are selected according to Eq. ([T]) but with 
their fitness cut-off at A (i.e., w[ — mm{wi, A}). Our al- 
gorithm might also be adapted to evolving systems, such 
as, for example, complex adaptive systems where fitness 
of some or all individuals changes in time. Let us empha- 
size that the performance of a selection method depends 
on a number of factors and also on a particular type of 
the optimization problem [l3j]. More detailed analysis of 
the efficiency of our algorithm in comparison with other 
selection methods will be presented elsewhere. 

In conclusion, we have shown that Holland's original 
idea of using fitness-proportionate selection, i.e., the so- 
called roulette-wheel selection, can be formulated as an 
algorithm of typically 0(1) complexity. The numeri- 
cal example shows that for sizes of populations used in 
genetic-algorithm applications, ranging from 10 2 to 10 4 , 
our algorithm offers a significant CPU gain over exist- 
ing routines based on a linear or binary search. The 
algorithm is very simple and we expect that it can be 
modified and used in even more sophisticated selection 
schemes. 
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